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Introduction 

The purpose of this paper is to introduce novices to the preparation and 
uses of machine readable data files (MRDF) . A fully documented machine 
readable data file consists of two basic units: a computer tape or other 
storage medium carrying the machine readable data, and documentation/ a tape 
or hard copy document that serves to communicate information about the data in 
order to permit users, uninvolved in the data collection process to gain 
access to the data file and to use all or part of the data for their own re- 
search or to compare with their research. 

The central elements of developing MRDF 1 s will be illustrated with 
examples from this investigator's current work in which data collected from 
1973 to 1977 are being prepared as a MDRF using the Statistical Package for 
the Social Sciences (SPSS) as the computer software for the task. 

The literature for preparation of machine readable data files is limited 
and largely fugitive. Most of the work done has been in the social sciences/ 
library archive preparation, or in work with data managenent systems in 
business and industry. The work reported here, and it is not as yet completed, 
relies most heavily on Robbin (1974-75) and Roistacher (1979) . The final 
product will include the major elements from these sources, adapted to the 
special needs of science education and to the SPSS software. 
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Values and Uses of MKDF'Ss 

Most research in science education is conducted by investigators gathering 
their own data, albeit on a small group of subjects, and analyzing their 
own results. More recently (Glass, 1978) has advocated the use of meta- 
analysis to combine data from many published studies as a useful research 
tool. He demonstrated the usefulness of meta-analysis in research on psycho- 
therapy (Glass and Smith, 1976) and the relationship of class size to achieve- 
ment. Other researchers have utilized meta-analysis procedures advantageously 
since Glass's procedures were published. 

Published summary statistics have limitations that also become limita- 
tions in meta-analysis procedures. Natural and social sciences researchers 
have long accumulated data, stored these in computer tapes, and made them 
available to the research community. Data tapes, such as those made available 
by government agencies such as the Census Bureau have been used extensively 
as data sources for research. The costs of securing valid data are increasing, 
and therefore, more attention needs to be given to data documentation, data 
sharing, and the multiple use of such a resource. Yet, this kind of resource 
is not readily available for science education research- 
Coordinated research programs developed and conducted by an individual 
investigator and her or his graduate students or by a group of investigators 
and students can be enhanced and facilitated by the preparation and execution 
of plans for developing and managing data files. The limitations of isolated 
empirical studies are obvious to most investigators, b~rt turnover of personnel 
and the lack of detailed documentation of work completed is an added deterrent 
to interrelating studies for the development of cumulative knowledge in 
science education. 
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Secondary analyses, made feasible by well documented data, have made 
invaluable contributions to knowledge in the social and natural science 
disciplines- They can be equally useful to the complex field of science 
education. 

Terminology 

The terminology related to machine readable data files is not yet 
completely standardized, but the terminology explained and adopted here is 
in conformity with Federal Information Processing Standards for documentation. 

Machine readable refers to files of numeric data in rectangular form 
and the documentation and formatting of those data. 

Data file is the file of numeric data recorded in rectangular form on 
magnetic tape or disc, or in columns on cards - 

User's guide is a comprehensive manual that describes the data file's 
identity, organization, contents, physical characteristics, and relation to 
computer hardware and software (Roistacher, 1979). For convenience, the 
user's guide can be subdivided into three parts: source documentation, 
codebook documentation, and technical documentation. . (Robbin, 1974-75). 

Source documentation is a description of the study and its procedures. 

Codebook documentation describes the logical structure of data items, 
their location in the collection of information, and the values ascribed to 
each data item (Robbin, 1974-75). Roistacher (1979) refers to the codebook 
as the "data dictionary." 

Technical documentation provides the information needed by users to 
access the file at their computing facility. 

These terms will be further elaborated as the development of a MRDF is 
described. 
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Data File 

The data file can be produced by using a variety of computer programs. 
The Statistical Package for the Social Sciences (SPSS) offers several 
advantages over other systems in this writer's judgment- The advantages of 

SPSS are the extensive labeling of variables and the value of variables, 

I 

the availability of "comments" that can bd included in the file, the extensive 
range of statistical analyses that can be computed with the package, and the 
provision for file manipulation, data transformation, and file management. 
SPSS is an integrated system of computer programs for the analysis of the 
kinds of data of importance in science education. The package enables the 
science educator to perform analyses through the use of natural- language 
control statements. Space does not permit an extensive discussion of the 
SPSS. Interested readers are referred to Nie, et al (1975) and Hull and 
Nie (1979) for complete information on SPSS. ¥ke ensuing discussion is based on 
the use of SPSS as a computer package for developing machine readable data 
files. 

The data file is prepared and stored as an SPSS system file. This procedure 
results in a file in which all the variables can be named and labeled, the 
values of each variable can be named and labeled, and comments can be inter- 
spersed as needed. 

The structure of an SPSS system file is based on cases, each with associated 
variables and values. All of the variables and values for each case are 
associated with identifiers for the case. Cases can be added to or deleted 
from the file. Variables can be added or deleted and new variables computed 
for each case and saved as part of the file. Cases can be sorted and vari- 
ables can be rearranged within the file. (See Appendix A for an example.) 

A critical phase in the preparation of data files is the process of 
checking and verifying the accuracy of the data before entering them 



in the SPSS system file. Once the data are in the SPSS system file, the user 
can run several kinds of accuracy checks. For example/ by using the 
FREQUENCIES program for all ID characters and for each variable / codings that 
are outside designated values can be easily spotted. Table 1 shows the out- 
put of such a run in the sex coding in a file. One keypunch error, a "3" 

Table 1. Frequencies for the two values of the sex code in 
File 5XPYR3. 

SEX 1 =MALE 2= FEMALE ! 



CATEGORY LABEL 


CODE 


ABSOLUTE 
FREQ 


RELATIVE 
FREQ 
(PCT) 


ADJUSTED 
FREQ 
(PCT) 


CUM 
FREQ 
(PCT) 


MALE 


1 


349 


48. 1 


43. 1 


48. 1 


FEMALE 


2 


376 


51 .8 


51 .9 


100.0 


OUT OF RANGE 




1 


. 1 


MISSING 






TOTAL- 


726 


TOO. 0 


100.0 





was exposed. It showed up as an "out of range" value in Table 1. This error 
could be corrected laboriously by hand or, if it was found that the correct 
code should have been a "1," the "3" could be permanently recorded as a "1," 
using the appropriate IF statement. 

To determine whether the variables and values for each case are accurate, 
the SAMPLE card can be used to specify a percentage of the cases for random 
sampling from the file. A random sample can be man using the LIST CASES 
facility, specifying all the variables for the cases. The output from this 
run can be checked to ascertain the completeness and accuracy of the file. 

User' s Guide 

The user's guide should include all of the information needed by any 
researcher to utilize the data with integrity and without the necessity for 
discussions with the file producer. For convenience of discussion and of 
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preparation and production, the user's guide can be described in three 
sections: source, codebook, and technical documentation. 
Source Documentation 

Source documentation describes the study or situation during which the 
data were gathered. It includes the theoretical framework, study design, all 
procedures and instruments used, how editing and data cleanup were accom- 
plished, and how missing data, ambiguities and marking errors were resolved. 
Any decisions made that affect the data, its completeness and quality, need to be 
provided to. the user. 

If interview data or coded data are included, instructions, procedures, 
protocols, reliability, and other pertinent data are to be reported in the 
user's guide. Definitions of- variables , how variables were generated, any 
data transformations and how they were produced are important kinds of 
information, and also important to the potential user. 

Full disclosure of procedures used to assure respondent privacy and 
confidentiality, if applicable, should be provided. It may be that some data 
were aggregated to preserve confidentiality. Users should be cautioned about 
disaggregations that could violate confidentiality of respondents 
(Roistacher, 1979) . 

Finally, if statistical output was generated, it is important to report 
how it was produced. If results of analyses using data in the file have been 
published, a bibliography of publications should be included in the user's 
guide. 

Source document material, since it is mostly prose, is generated using a 
text editor such as SCRIBE, TEXT EDITOR, and/or PROSE. The plan initially 
formulated for the current project was to compose this material using a text 
editor via time-sharing. Word processing equipment with telecommunications 
capability now seems preferable to time-sharing for this task because of the 
reduced cost due to loss on-line time. 



C odebook Documentation 

According to Rbbbin (1974-75) the ideal codebook should "stand alone" by 
describing every variable contained in the data file- Codebook documenta- 
tion includes the full text of questions, test items, interview questions 
and complete variable and value names for each data item. Ranges, means, 
and standard deviations should be provided for each variable as well as fre- 
quencies for each value of each variable. 

The SPSS subroutines, CONDESCRIPTIVE or FREQUENCIES, can be used to 
calculate the statistics for each variable. FREQUENCIES is most useful 
because it provides the number and percent of respondents selecting each 
value of each variable and descriptive statistics for the variable. A 
reference dictionary listing the variables alphabetically and by their sequence 
in the file is an option that can be output using the FREQUENCIES facility. 
Appendix B illustrates the three types of outputs availabe from FREQUENCIES 

described here. 
Technical Documentation 

This documentation provides potential users with the names of each file 
and the tape record specifications for each file: number of tracks on the 
tape (seven or nine track), density (BPI) , parity, record blocking, maximum 
block size, whether the tape is labeled, and the character code (ASCII or 
EBCDIC). In most instances, the potential user should contact her or his 
computing facility before ordering a data tape to determine how Lhe tape files 
should be written for her or his computer facility as the tape output to the 
recipient can be written to those specifications. Appendix C provides an 
example of an order form that a data file producer would provide to a potential 
user so that a machine readable data tape and user's guide could be properly 
produced. 
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Additional Assets of An SPS3 System File 



Use of an SPSS system file requires that the user have this package 
available at her or his computer facility. Without this capability, the 
data file and a printed user's guide can make the data available. However, 
the documentation, variable labels, variable names and value labels will not 
be available as part of a machine readable data record. 

With an SPSS capability, the data file can be output as card images in 
the character code required by the user. This is accomplished by using the 
WRITE CASES procedure in SPSS- Labels and comments are output by using the 
LIST FILE INFO or WRITE FILEINFO procedures. WRITE FILEINFO can produce either 
punched cards or a tape with the card images for most of the data documentation 
in the SPSS system file. The format statement from the WRITE CASES statement 
should be supplied to the user. This can be easily adapted to the INPUT 
FORMAT statement for reading the data from the cards or tape. 

LIST FILEINFO would be used alone when sending data documentation to a 
non-SPSS user. LIST FILEINFO would be used together with WRITE FILEINFO to 
produce a hard copy document of the contents of the cards or tape produced 
by the WRITE FILEINFO procedure - 

If cases need to be deleted or selected from SPSS system files, two 
procedures are available to use in conjunction with the SAVE FILE function of 
SPSS. For deleting cases, a REJECT IF statement is available on the 8.0 and 
later version of SPSS. This allows the case by case deletion from the files 
to be saved. Using the SELECT IF together with the NE (not equal) operator 
produces logic errors that result in no cases being deleted. REJECT IF 
together with the EQ (equal) operator will properly delete any cases identified. 

The SELECT IF statement, used together with the EQ (equal) operator can. 
select any subgroup of cases for any analysis desired. If the number of cases 
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is large, the REJECT IF statement can be used together with the EQ operator 
to delete a small number of cases. While a complete SPSS system file cannot 
be transferred directly . from one computer to another, the procedures listed 
above will enable the user to reconstitute the SPSS system file on her or his 
own hardware. 

Files with up to 500 variables can be processed by the standard SPSS 
program. Files of up to 1,000 variables can be processed by the large version 
of SPSS* Files of up to 5,000 variables can be produced by using the ARCHIVE 
procedures of SPSS. In addition to the large number of statistical procedures 
available in SPSS, a newly added REPORT capability (Hull and Nie, 1979) 
enables users to format tables for publications directly from the SPSS system 
file- Variables can be called by names from a system file without the need 
for specifying format information. 

Producing Machine Readable Data Files 

The use of the Statistical Package for the Social Sciences (SPSS) 
has been briefly described here as a system of computer programs that has 
advantages for producing MRDFs- This system plus text editing programs and 
the necessary hardware are all that are needed. 

A word processor that can be used as an interactive terminal offers a 
much more efficient and economical means for developing the user's guide. 
The prose sections can be stored in floppy discs of the word processor. 
Any statistical tables, for example, .those, for the codebook, could be 
generated using the SPSS and then disposed from the computer file to the word 
processor files* Editing and rearranging, and the merger of prose and tables 
can be easily accomplished on the word processor, building the final user's 
guide, which can then be disposed to and stored in the computer system files. 

Several options will be available for providing potential users with the 
data file and user's guide: both furnished on computer tape; data file on 




tape, user's guide printed out by either computer or wor-1 processor; data 
file and/or user's guide transmitted by telephone to potential user's word 
processor. 

Summary 

The purpose of this paper is to recommend the preparation and use of 
machine readable data files as a valuable tool for research in science 
education. The quality and usability of such files is dependent on the 
collection , verification , and documentation of such files- Once carefully > 
prepared, these data can be used for secondary analysis, for original 
research in asking new questions of the data, and as comparison data for 
numerous studies. 

Documentation is of critical importance. It serves four important 
functions for data by providing: 

1. a conceptual framework 

2. historical reference 

3. general instructions for communication between the producer and user 

4. reports of successful output (Robbin, 1974-75). 



I wish to acknowledge the comments and suggestions made by my colleague, 
Richard R. Tolman, on a previous draft of this paper. Many of his ideas were 
incorporated into this final paper. 
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APPENDIX "A' 



CONTENTS OF CASE NUMBER 29 



SEX 

5CH75 

SNR76 

VAR011 

VAR016 

VAR021 

VAR026 

VAR031 

VAR033 

VAIIO'H 

VAR046 

VAR051 

VAR05G 

VAROGl 

VAR0C6 

VAR071 

VAR076 

VAR08I 

VAROSG 

VAR09I ' 

VAR403 

VAR40G 

RLP002 

REP007 

REPCI2 

RLPDI7 

RCP02J 

REPQ27 

RCP032 

nCP037 

REPI03 

REP108 

REPU3 

REP113 

REP123 

REP12G 

REP133 

REP133 

REP202 

RF.P207 

REP212 

REP2I7 

PEP302 

REP307 

HEPS 12 

REP312 „. 

W ERLC 



29. 
2, 
f. 

43, 
t, 
3, 
2. 
3, 
6.' 
G. 
3. 
it 
1, 
4, 
1. 
5, 
5, 
3, 
5. 
I. 
47, 



0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 

0' 
0 
' 0 
0 
0 
0 
0 
D 



SUBFILE 
SCH74 
TEAC75 
AGE 

VAR012 
VAR017 
VAR022 
VAR027 
VAR032 
VAR037 
VAR042 
VAK047 
VARC52 
VAFiO:;? 
VA8062 
VAA0G7 
VAR072 
VAR077 
-VAR002 
VAR0H7 
VAR092 
VAR404 
VAR4C9 
REP003 
RLPOOQ 
RCP013 
lltl'010 
REP023 
REP028 
REP033 
REP038 
HEP1C4 
REP 109 
HSPM 
REM 19 
RJP124 
P.EP129 
REP134' 
REP 139 
RIIP203 
REP200 
REP213 
REP218 
REP303 
REP300 
I1EP313 
REP31B 



HSPALL 

t. 
2, 
146, 
3. 
1. 
3, 
G, 
6, 
6, 
6. 
t. 
2, 
5, 
I. 
4, 
5, 
3, 
4, 
0 

16, 
0 

3, 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 

c 

0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 



CASWGT 
TEAC74 
SNR75 
VAR003 
VAR0I3 
VAR018 
VAR023 
VAR028 
VAR033 
VAR03Q 
' VAR043 
VAR04B 
VAR053 
VAR058 
VAR063 
VA90GQ 
VAR073 
VARQ70 
VAR033 



VAR400 
VAR405 
VAR410 
REP004 
REP009 
I1EP0M 



REP034 
REP029 
REP034 
REP039 
REP105 
REP110 
REP115 
REP120 
,'2P125 
REP130 
REP135 
REP140 
REP204 
REP209 
REP2I4 
REP219 
REP304 
, REP309 
REP314 
REP319 



1,0000 
1, 
43, 
7, 
3. 
1, 
3, 
3, 
P. 



2. 
4, 
1. 
I, 
1, 
5, 
3, 
SB. 

0 

0 

1, 

0 

o 

0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 

' 0' 
0 
0 
0 
0 
0 
0 
0 
0 
0 



PERSON 


Pin 
23. 




SNR/4 




a i ft i y.Tt 


^ tit f 


\ • 






A 

3i 


VAKV i V 


VAR014 




VAr\v • 3 


VAR0I9 




WAPflOfl 
VAhU*U 


\tknt\f)A 
VAKvrf't 










VAR030 




s. 

V I 


VAR035 


VARvotf 


(S 


VAROflO 


ii i r\ \ .1 A 

VAllsM*! 


< 

1 1 




il i nt\ Art 


r 

5« 




VAR054 


A 

. ill 


VAnO J J 


VAR059 


r 

bi 




VAR054 


t. 




VAflOGB 


t. 


VAH070 


VAR074 


3, 


VAR07S 


VAR079 


5, 


VAROSO 


VAR004 


4i 


VAWS5 


VAR(!i'l9 


39. 


It t ft A !1 ft 


VAR4Q1 . 


0 


VAK402 


VAR40G 


Ii 


VAM07 


VAR411 


u 




REP005 


0 


Alifl A Af! 


REPOIO 


0 


r< e! fi A 1 1 


REP0I5 

H 1M1 ft ft A 


ft 

0 

A 




RLPOSO 

H f* Ii M ft tl 


u 
u 




RL'POjO 


0 


f\L< VOW 


RLP035 


ft 

u 


REP » 01 


ft 

0 


or d I no 


REPI06 


0 


OlTD 4 A7 

illil IV/ 


REP111 


A 

0 


Kt" ) %l 


REP11G 


A 


Her 1 1 / 


REP121 


0 




REP126 


0 


Fx 0 f\ 1 0 1 

RE? 1 27 


REP131 


0 




REP136 


0 


UEP137 


REP141 


0 


REP201 


REP205 


0' 


REP206 


REP210 


0 


/"EP211 


HEP215 


0 


REP2I6 


REP220 


0 


RSP301 


flGPSOii 


0 


REP30G 


REP310 


* 0 


REP3M 


REP315 


0 


• REP3I6 


REP320 , 


0 


REP321 



32 



0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0*' 
0 
0 
0 
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CONTENTS OF CASE NUMBER 30 



SEQNUM 


30, 


SUBFILE 


HSPALL 


CASWGT 


SEX 


2, 


5CH74 


t. 


TEAC74 


SCH7B 


1, 


TEAC75 


3. 


SNR75 


SNR76 


0 


AGE 


0 


VAR003 


VAR011 


0 


VAR012 


0 


VAR013 


VAR016 


0 


VAR017 


0 


■ VAR0I8 


VAR021 


c 


VAR022 


0 


VAR023 


VAR026 


0 


VAR027 


0 


VAR028 


VAR031 


0 


VAR032 


0 


VARQ33 


VAR036 


0 


VAR037 


0 


VAR038 


VAR041 


0 


VAR042 


• 0 


VAR043 


VAR046 


0 


VAR047 


0 


VAR041 


VAR0S1 


0 


VAR052 


0 


VAR053 


VARC56 


0 


VAR057 


0 


VAR058 


VAR051 


0 


VAR0G2 


0 


VAR063 


VAR066 


0 


VAR067' 


0 


VAROSfl 


VA3071 


0 


VAR072 


0 


VAR073 


VAR076 


0 


VAS077 


0 


VAR07Q 


VAROB1 


0 


VAR0B2 


0 


VAR083 


VAR0Q6 


0 


VAR0G7 


0 


VAR088 


VAR091 


0 


VAR092 


0 


VAR400 


VAH403 


0 


VAR404 


0 


• VAR405 


VAR408 


0 


VAR409 


0 


VAR410 


REP002 


0 


' REPGP3 


0 


REP004 



1.0000 


PERSON 


30, 


STATUS 1 


0 


\ 1. 


SKR74 


. 44. 


STATUS2 


0 


■ 44. 


SCH76 


0 


TEAC76 


0 


0 


VAR009 


0 


VASOIO 


0 


0 


VAR014 


0 


VAR0I5 


0 


0 


' VAR019 


0 


VAR020 


• 0 


0 


VARD24 


0 


VAR025 


0 


0 


VAR029 


0 


VARC30 


0 


0 


VARC34 


0 


VAR035 


0 


0 


VAR039 


0 


VAR040 


0 


0 


VAR044 


0 


VAR045 


0 


0 


VAR049 


0 


VAR050 


0 


' 0 


■ VAR054 


0 


VAR055 


0 


0 


VARC59 


0 


VAR060 


0 


0 


VAR064 


0 


VAR0S5 


0 


0 


VAR0B9 


0 


VAR070 


0 


0 


VAR074 


0 


VAR075 


0 


0 


VAR079 


0 


VAR080 


0 


0 


VAR0S4 


0 


VAR0U5 


0 


0 


VAR089 


0 


VAR090 


\ 0 


0 


VAR401 


0 


VAR402 


0* 


0 


VAR406 


0 


VAR407 


0 


0 


VAS411 


0 


REP001 


0 


0 


REP005 


0 ' 


REP006 


0 
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APPENDIX "B t! 



AGE 



AGE IN MONTHS AS OF MAY 1974 



CATEGORY LABEL 



CODE 


ABSOLUTE 
FREQ 


RELATIVE 
FREQ 
CPCT 3 


ADJUSTED 
FREQ 
(PCT> 


CUM 
FREQ 
CPCT) 




1 


. 1 


. 1 


. 1 


122. 


1 


. 1 


. 1 


.3 


124. 


1 


. 1 


. 1 


.4 


126. 


4 


.5 


. G 


1 . 0 


127. 


2 


.2 


.3 


1 .3 


128. 


1 


. 1 


. 1 


1 .4 


129. 


1 


. 1 


. 1 


1 .6 


130. 


3 


.4 


.4 


2.0 


133. 


1 


, 1 


. \ 


2. 1 


134. 


2 


.2 


.3 


2.4 


136. 


5 


.6 


.7 


3.1 


137. 


25 


3.1 


3.5 


6.6 


1O0. 


22 


2.6 


3. 1 


9.8 


139. 


30 


4.7 


5.4 


15. 1 


140. 


56 


7.0 


7.9 


23. 1 


141 . 


52 


6.5 


7.4 


30.4 


142. 


51 


6.4 


7.2 


37.6 


143. 


44 


5,5 


6.2 


43.8 


144. 


37 


4.6 


5-2 


49.1 


145. 


48 


6.0 


6.8 


55.9 


14G. 


43 


5.4 


6.1 


62.0 


147. 


42 


5.3 


5.9 


67.9 


140, 


47 


5.9 


6. C 


74.5 


149. 


37 


4.6 


5.2 

19 


79.8 



ERIC 



!S ON HSPYR1 AGE DATA 81/02/23. 10.05.10. PAGE 3 

YR1 (CREATION DATE = 81/02/20.) ADDED AGES, DELETED AND RENAMED VARIABLES 1973-74 



150. 


26 


3.3 


3.7 


83.5 


151 . 


23 


2.9 


3.3 


86.7 


152. 


12 


1.5. 


1.7 


88.4 


153. 


4 


.5 


.6 


89.0 


154. 


9 


1 . 1 


1.3 


90.2 


155. 


7 


.9 


1.0 


91 .2 


156. 


8 


1.0 


1 . 1 


92.4 


157. 


14 


1.7 


2.0 


?4.3 


158. 


14 


1.7 


2.0 


96.3 


159. 


9 


1 . 1 


1.3 


97.6 


160. 


6 


.7 


.8 


98.4 


161 . 


2 


.2 


.3 


98.7 


162. 


2 


.2 


.3 


99.0 


163. 


1 


. 1 


.1 


99.2 


164. 


1 


. 1 


.1 


99.3 


165. 


1 


. 1 


.1 


99.4 


166. 


1 


.1 


. 1 


99.6 


168. 


2 


.2 


.3 


99.9 


170. 


1 


.1 


. 1 


100.0 


0 


93 


1 1.6 


MISSING 




TOTAL 


800 


100.0 


100.0 





ERIC 



5 ON H3PYR1 AGE DATA 81/02/23. 10.05,10. 

W\ (CREATION DATE = 81/02/20.) ADDED AGES, DELETED AND RENAMED VARIABLES 1973-74 
«x**x*«* ( 14) 



PAGE 



***** ( 9) 

K**X ( 6) 

* ( 2) 

* ( 2) 

* ( 1 ) 



* ( 



1 ) 



* ( 



1 ) 



* ( 1 ) 



* ( 2) 



* ( 1) 



XXXXXXXXKXXXXXXXXXtXXXXXXXXXXXXXXXXXXXXXXXXXXXX ( 



93) 



. . I . 
20 



. . I , 
40 



..I. 
60 



..I. 
80 



, . . I 
100 



r REQUENCY 



145. 181 
140.000 

1.711 
114.000 

4.542 

707 



STD ERR 
STD DEV 
SKEWNESS 
MAXIMUM 
.95 C.I, 



.248 
6.594 

,275 
170.000 
144.694 



MEDIAN 144.635 
VARIANCE 43.486 
RANGE 56.000 
SUM 102643.000 
TO 145.668 



MISSING CASES 



93 



!2 



23 



9 

ERIC 



■S FOR SELECTED VARIABLES 



81/03/31. 10.28.51. PAGE 84 



a YR3 (CREATION DATE 2 81/02/26.) 1 975-76 HSP DATA FILE 
. INDEX 



PAGE 


VARIABLE 


PAGE 


VARIABLE 


PAGE 


VARIABLE 


PAGE 


2 


CHG202 


24 


FEL008 


45 


FEL029 


66 


3 


CHG203 


25 


FEL009 


46 


FEL030 


67 


5 


CHG204 


26 


FEL010 


47 


FEL031 


68 


6 


CHG205 


27 


FEL011 


48 


FEL032 


69 


7 


CHG206 


28 


FEL012 


49 


FEL033 


70 


8 


CHG207 


29 


FEL013 


50 


FEL034 


71 


9 


CHG208 


30 


FEL014 


51 


FEL035 


72 


10 


CHG209 


31 


FEL015 


52 


FEL036 


73 


1 1 


CHG210 


32 


FEL016 


53 


FEL037 


74 


12 


CHG21 1 


33 


FEL017 


54 


FEL038 


75 


13 


CHG212 


34 


FEL018 


55 


FEL039 


76 


14 


CHG213 


35 


FEL019 


56 


FEL040 


77 


15 


CHG214 


36 


FEL020 


57 


FEL041 


78 


16 


CHG215 


37 


FEL021 


58 


FEL042 


79 


17 


FELOOl 


38 


FEL022 


59 


FEL043 


80 


18 


FEL002 


39 


FEL023 


60 


FEL044 


81 


19 


FEL003 


40 


FEL024 


61 


FEL045 


82 


20 


FEL004 


41 


FEL025 


62 


FEL046 


83 


21 


FEL005 


42 


FEL026 


63 






22 


FEL006 


43 


FEL027 


64 






23 


FEL007 


44 


FEL028 


65 







24 

25 



ERIC 



IES FOR SELECTED VARIABLES 



61/03/31 . 10.28.51 . PAGE 65 



XPYR3 (CREATION DATE = 61/02/26.) 1975-76 HSP DATA FILE 
\C INDEX 



E PAGE 


WAR I ABLF 

vnl\l MOLL 


PAGE 


VARIABLE 


PAGE 


VARIABLE 


PAGE 


3 


CHG203 


25 


FEL009 


46 


FEL030 


67 


5 


CH0204 


26 


FEL010 


47 


FEL031 


68 


6 


CHG205 


27 


FEL01 1 


48 


FEL032 


69 


7 


CHG206 


28 


FEL012 


49 


FEL033 


70 


8 


CHG207 


29 


FEL013 


50 


FEL034 


71 


9 


CHG208 


30 


FEL014 


51 


FEL035 


72 


10 


CHG209 


31 


FEIL015 


52 


FEL036 


73 


11 


CHG210 


32 


FEL016 


53 


FEL037 


74 


12 


CKG21 1 


33 


FEIL017 


54 


FEL038 


75 


13 


CHG212 


34 


FEIL018 


55 


FEL039 


76 


14 


CHG213 


35 


FEIL019 


56 


FEL040 


77 


15 


CHG214 


36 


FEL020 


57 


FEL041 


78 


16 


CHG215 


37 


FEL021 


58 


FEL042 


79 


17 


FEL001 


38 


FE!L022 


59 


FEL043 


80 


10 


FEL002 


39 


FEL023 


60 


FEL044 


81 


19 


FEL003 


40 


FEL024 


61 


FEL045 


82 


20 


FEL004 


41 


FEL025 


62 


FEL046 


83 


21 


FEL005 


42 


FEL026 


63 


SEX 


2 


22 


FEL006 


43 


FEIL027 


64 






23 


FEL007 


44 


FEL028 


65 






24 


FEL008 


45 


FEIL029 


66 







2& 
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APPENDIX 11 C" 

A Style Manual for Machine-Readable Data Files 
and Their Documentation 



53 



EXAMPLE 14: DATA ORDER FORM , from Roistacher (1980) 



Name: 



Address: 



Tape Recording Specifications 

Seven-track Tape 

Density (BP I) 200 556 800 

Parity Even Odd 

Record blocking Blocked Unblocked 



Maximum block size 



IBM standard labels Labeled Not labeled 

1-6 character label . 

Character code BCD 

Nine-track Tape 

Density (BPI) 800 1600 6250 

Parity Odd 

Record blocking Blocked Unblocked 

Maximum block size 



IBM standard labels Labeled Not labeled 

1-6 character label 

* Character code EBCDIC 

(Over) 



Example \ki Data Order Form 
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